Skip to main content

View Post [edit]

Poster: zwol Date: Sep 1, 2015 2:17pm
Forum: web Subject: How do I retrieve the original form of a page from the Wayback Machine?

When you retrieve a page from the Wayback Machine via `<a href="http://web.archive.org/web/TIMESTAMP/URL_OF_PAGE" target="_blank">http://web.archive.org/web/TIMESTAMP/URL_OF_PAGE</a>`, the HTML you get back has been modified from what it originally was: all links, resources, etc. have been adjusted to point into the archive, and there's a big blob of additional HTML to create the Wayback Machine's toolbar.<BR><BR>Is there any way to retrieve a page _without_ those modifications? In other words, I want to see the page exactly as the crawler saw it. This is for automated analysis of the history of a page.

Reply [edit]

Poster: DKL3 Date: Sep 1, 2015 2:45pm
Forum: web Subject: Re: How do I retrieve the original form of a page from the Wayback Machine?

Add "id_" after the timestamp in a url. For example:<BR><BR><a href="http://web.archive.org/web/20150901185758id_/http://www.example.com" target="_blank">http://web.archive.org/web/20150901185758id_/http://www.example.com</a><BR><BR>Hope this helps. :)

Reply [edit]

Poster: zwol Date: Sep 3, 2015 11:47am
Forum: web Subject: Re: How do I retrieve the original form of a page from the Wayback Machine?

Thanks, that seems to work. (Sure would be nice if it were *documented* anywhere :-P)

Reply [edit]

Poster: slowride13 Date: Sep 29, 2015 9:25am
Forum: web Subject: Re: How do I retrieve the original form of a page from the Wayback Machine?

does *NOT* work.<BR><BR>Example:<BR>Looking for original of my old ISP, mindpsring.com<BR> trying<BR><BR><a href="http://web.archive.org/web/19961219135430id_/http://www.mindspring.com/" target="_blank">http://web.archive.org/web/19961219135430id_/http://www.mindspring.com/</a><BR><BR>or<BR><BR><a href="http://web.archive.org/web/19961219135430/http://www.mindspring.com/" target="_blank">http://web.archive.org/web/19961219135430/http://www.mindspring.com/</a><BR><BR>NOTHING, dumps you to mindspring and a redirect to earthlink.<BR><BR>BROKEN!

Reply [edit]

Poster: Samuel Bronson Date: May 18, 2019 4:41pm
Forum: web Subject: Re: How do I retrieve the original form of a page from the Wayback Machine?

&gt; <a href="http://web.archive.org/web/19961219135430id_/http://www.mindspring.com/" target="_blank">http://web.archive.org/web/19961219135430id_/http://www.mindspring.com/</a><BR>&gt; or<BR>&gt; <a href="http://web.archive.org/web/19961219135430/http://www.mindspring.com/" target="_blank">http://web.archive.org/web/19961219135430/http://www.mindspring.com/</a><BR>&gt; NOTHING, dumps you to mindspring and a redirect to earthlink.<BR><BR>Happily, these seem to work fine now.